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FIELD OF THE INVENTION 

The invention relates generally to digital integrated circuits and, more particularly, to 
reducing leakage current in power-saving standby modes of digital integrated circuit 
operation. 

BACKGROUND OF THE INVENTION 

The demand for higher clock-rates and lower power supply voltages in digital 

integrated circuits such as CMOS circuits results in rapidly increasing levels of standby 

leakage current (i.e., the current consumed by a CMOS circuit when the clock is not active). 

As an example, multi-million gate LC.s operating in the Gigahertz region with supply 

voltages below 1.5V can have standby leakage of 100mA or higher. This level of leakage 

current represents a significant problem for portable (battery operated) applications. This 
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problem is conventionally addressed by introducing products that feature state-retentive, 
low-leakage standby modes. 

Most conventional state retention schemes collapse the power supplies partially or 
entirely while maintaining the state of all the register elements. Upon restoration of the 
5 power, all circuit nodes return to their previous state since all nodes can be derived directly 

from the state of the register elements. 

In conventional power-down applications, power supply to circuits can be turned off 
in order to reduce the standby power consumption. The state retention flip-flops that store 
the status of operation of the circuit need to store the data in latches with low leakage current 

1 0 during power-down phase. The retention latch (called a shadow latch hence forth) is formed 

with thick-oxide (low leakage) transistors and is powered by a separate permanent power 
supply to retain data in power-down or retention mode. Some disadvantages of such a 
scheme are: the additional circuitry required for such a shadow latch implementation is 
magnified at chip level when a large number of retention flip-flops are required; generation 

15 of additional control signals required to drive the flip-flop in and out of retention mode not 

only increase the flip-flop area, but also pose routing problems at block level; and additional 
shadow latch and control circuitry can load the speed-critical path of the flip-flop worsening 
the propagation delay of the flip-flop. 

As mentioned above, some conventional approaches use retention registers (including 

20 shadow latches) to retain state while lowering the device leakage. One such scheme has two 
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supplies, a permanent supply for retention and a virtual supply for conventional logic power. 
Low V t (leaky) devices are powered by the virtual supply, while High-V t (low leakage) 
devices are used for retention and powered from the permanent supply. This architecture has 
limitations. Both supplies must be present for normal operation. This introduces a physical 
design overhead of routing an additional power rail to all the registers. Also, minimum 
operating voltage is limited by the High-V t devices, effectively prohibiting conventional 
Vbox-min testing. 

It is desirable in view of the foregoing to provide state retention registers which avoid 
the aforementioned disadvantages of conventional approaches. The various disadvantages of 
the conventional approaches can be avoided by various exemplary embodiments of the 
present invention, wherein: a differential circuit is used to load the shadow latch from the 
normal functional latch; the signal used to restore data from the shadow latch to the normal 
functional latch is a "don't care" signal while the shadow latch is retaining the data during 
low-power standby mode; retained data from the shadow latch is restored to the normal 
functional latch via a transistor gate connected to a node of the shadow latch where the 
retained data is provided; a power supply other than the shadow latch's power supply powers 
the data restore operation; and the normal functional latch is operable independently of the 
operational states of the high V t transistors used to implement the state retention 
functionality. In addition, an isolation apparatus is provided to retain an output of a logic 
module while the logic module is powered-down. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 diagrammatically illustrates an example of a logic block with a low- 
power, standby mode according to the invention. 

FIGURE 2 diagrammatically illustrates exemplary embodiments of a flip-flop with 
5 state retention capability for use in a state retention register according to the invention. 

FIGURE 3 diagrammatically illustrates further exemplary embodiments of a flip-flop 
with state retention capability for use in a state retention register according to the invention.. 

FIGURE 4 diagrammatically illustrates pertinent portions of exemplary embodiments 
of a wireless communication apparatus according to the invention. 
1 0 FIGURE 5 is a timing diagram which illustrates exemplary operations of the power 

state controller of FIGURE 4. 

FIGURES 6, 6 A and 7 diagrammatically illustrate exemplary schemes for distribution 
of control signals used by state retention circuitry according to the invention. 

FIGURE 8 diagrammatically illustrates further exemplary embodiments of a flip-flop 
1 5 with state retention capability for use in a state retention register according to the invention. 

FIGURE 9 diagrammatically illustrates further exemplary embodiments of a flip-flop 
with state retention capability for use in a state retention register according to the invention. 

FIGURE 10 diagrammatically illustrates further exemplary embodiments of a flip- 
flop with state retention capability for use in a state retention register according to the 
20 invention. 
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FIGURE 1 1 diagrammatically illustrates a plurality of logic modules which can be 
selectively powered-down. 

FIGURE 12 diagrammatically illustrates exemplary embodiments of a retention 
apparatus for retaining an output of a logic module while the logic module is powered-down. 
5 FIGURE 1 3 is a timing diagram which illustrates further exemplary operations which 

can be performed by the power state controller of FIGURE 4. 
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DETAILED DESCRIPTION 

FIGURE 1 diagrammatically illustrates a power switching arrangement according to 
exemplary embodiments of the invention. As shown in FIGURE 1, a suitable transistor is 
provided as a header switch for selectively (in response to the signal UP/DN) connecting and 
5 disconnecting the module level power supply VDD to and from the chip level (permanent) 

power supply VCC. The module level power supply VDD provides operating power for a 
logic module that includes state retention registers according to the invention. VDD is 
connected to VCC when UP/DN is activated, and is disconnected from VCC when UP/DN is 
inactivated. 

1 0 FIGURE 2 diagrammatically illustrates exemplary embodiments of a flip-flop with 

state retention capability for use in a state retention register according to the invention. The 
flip-flop of FIGURE 2 is a multi-threshold CMOS (MTCMOS) flip-flop. This MTCMOS 
flip-flop includes normal core transistors which are used throughout the logic module and 
have a first gate oxide thickness, and also includes additional transistors having a thicker 

1 5 gate oxide (and correspondingly less leakage) than the normal core transistors. These thick 

gate oxide transistors are used to implement the state retention functionality of the flip-flop. 
The inverters connected back-to-back between the nodes Nl 0 and Nl 1 form a shadow latch 
for retaining data while power is removed from the normal functional (in this example, DQ) 
flip-flop circuitry. These inverters are formed with thick oxide (low leakage) transistors and 

20 are powered by a separate power supply VRETAIN, which is produced from the permanent 
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power supply VCC (see also FIGURE 1). As examples, VRETAIN can be produced by a 
VRETAIN power supply in some embodiments, and can be connected to VCC in other 
embodiments (shown by broken line in FIGURE 1). The remainder of the FIGURE 2 flip- 
flop is powered by the module level power supply VDD, which also powers the other logic 
5 of the logic module in which the state retention register resides. The entire flip-flop of 

FIGURE 2, including the shadow latch, shares a common ground with the remainder of the 
associated logic module. 

The complementary clock signals CLK and CLKZ are used in conventional fashion 
to operate the normal functional flip-flop circuitry. A save signal SAVE and restore signal 
1 0 REST (and its inverse RESTZ) are used to transition the FIGURE 2 flip-flop between active 

and state retention modes of operation. During the active mode of operation (i.e., the normal 
functional flip-flop operation), the SAVE and REST signals are maintained at a logic zero 
level. 

Prior to disconnecting VDD from VCC (see also FIGURE 1), the data stored in the 
1 5 normal functional flip-flop circuit must be saved into the shadow latch. To accomplish this, 

the SAVE signal is strobed high, thereby activating a differential pull-down network at Ml , 
M2 and M3. The pull-down network includes a pair of legs respectively connected to the 
complementary storage nodes at the input and output of the inverter 21 of the slave latch of 
the normal functional flip-flop. Depending on the logic state of the normal functional flip- 
20 flop, one of the legs of the differential pull-down network is activated to save the data into 
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the shadow latch. The pull-down network can be designed to fight off the weak thick-oxide 
PMOS transistors of the shadow latch. In some embodiments, the transistors at Ml , M2 and 
M3, and the transistors of the shadow latch are sized in order to ensure that the contents of 
the normal functional flip-flop are written into the shadow latch at the worst case process 
5 corner scenario. In the worst case scenario, the NMOS transistors Ml and M2 in the weak 

process corner need to fight off the associated PMOS transistors of the shadow latch in the 
strong process corner. The flip-flop of FIGURE 2 can be made even more robust and 
reliable in some embodiments by designing for the aforementioned worst case process comer 
scenario at low temperature and low power supply voltage. 

1 0 After the SAVE signal has been strobed and the data from the normal functional flip- 

flop has been stored into the shadow latch, the FIGURE 2 flip-flop is ready to go into 
retention mode. The header switch of FIGURE 1 is used to cut off VDD from VCC, so all 
nodes in the logic module that are powered by VDD decay to almost zero volts. However, 
the shadow latch is still powered by the separate power supply VRETAIN, so the data is 

1 5 retained in the shadow latch. In order to reduce the leakage current while the shadow latch is 

retaining data with the normal functional flip-flop circuitry powered-down, the shadow latch 
data storage nodes Nl 0 and Nil should be isolated from the powered-down circuitry. This 
is accomplished by thick oxide transistors Ml , M2, M5 and M6. The transistor stack at M4 
- M7 switchably connects the state retention storage node N10 to the node N8 of the normal 

20 functional flip-flop. The use of this transistor stack arrangement rather than, for example, a 
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pass gate arrangement between node Nl 1 and node N8, permits the storage node N10 to be 
connected to the gates of transistors M5 and M6, rather than having the storage node Nl 1 
connected to the sources (or drains) of a pass gate arrangement. Connection of node N10 to 
the gates of thick oxide transistors M5 and M6 advantageously reduces the possibility of 
5 current leakage from the shadow latch. 

The complementary signals REST and RESTZ are used to restore the data from the 
shadow latch to the normal functional flip-flop. As VDD is re-connected to VCC (see also 
FIGURE 1), the signal REST is driven to a logic 1 voltage. This activates thin oxide 
transistors M4 and M7 to power the inverter at M5 - M6, which creates a read-back path 

10 from the shadow latch to the slave latch of the normal functional flip-flop for restoring the 

retained data. Also, when the signal REST is driven to a logic 1 voltage, the thin oxide 
transistors at M8 and M9 disable the feedback path of the slave latch of the normal 
functional flip-flop. Upon re-connection of VDD to VCC, only the shadow latch drives node 
N8, via the inverter at M4 - M7. This ensures that the node N8 is restored to the voltage that 

1 5 was present there prior to disconnecting VDD from VCC. After the shadow latch has driven 

node N8 to its previous voltage, VDD is re-connected to VCC, after which the REST signal 
is returned to logic zero. At this point, the shadow latch is isolated from the normal 
functional flip-flop portion of FIGURE 2, which is now ready to resume its normal 
functional DQ flip-flop operation. 
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FIGURE 3 diagrammatically illustrates pertinent portions of further exemplary 
embodiments of a state retention flip-flop arrangement for use in a state retention register 
according to the invention. FIGURE 3 illustrates a flip-flop in which the thin oxide 
transistors M4 and M7 drive node N8, while thick oxide transistor M5 is connected between 
5 VDD and M4, and thick oxide transistor M6 is connected between M7 and ground. The gate 

signals controlling transistors M4 - M7 are the same as illustrated in FIGURE 2. The 
remainder of the state retention flip-flop can be otherwise identical to the structure shown in 
FIGURE 2, and the arrangement of FIGURE 3 operates in generally the same fashion 
described above with respect to FIGURE 2. 

1 0 FIGURE 8 diagrammatically illustrates further exemplary embodiments of a flip-flop 

with state retention capability for use in a state retention register according to the invention. 
In FIGURE 8 (and in FIGURES 9 and 10 also), reference characters Ml, M2, etc. from 
FIGURES 2 and 3 are re-used but, as can be seen from the drawings, they do not necessarily 
refer to the same types of transistors (NMOS or PMOS, thick or thin oxide) to which they 

15 refer in FIGURES 2 and 3. The flip-flop of FIGURE 8 is a multi-threshold CMOS 

(MTCMOS) flip-flop. As in FIGURE 2, the flip-flop of FIGURE 8 includes normal core 
transistors which are used throughout the logic module and which have a first gate oxide 
thickness, and also includes additional transistors having a thicker gate oxide (and 
correspondingly less leakage) than the normal core transistors. These thick gate oxide 

20 transistors are used to implement the state retention functionality of the flip-flop. The 
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inverters connected back-to-back between nodes N10 and Nil form a shadow latch for 
retaining data while power is removed from the normal functional flip-flop circuitry, which 
normal functional flip-flop circuitry can be, for example, generally the same as described 
above with respect to FIGURE 2. The inverters between nodes N10 and Nil are formed 
5 with thick oxide (low leakage) transistors and are powered by the separate power supply 

VRETAIN described above with respect to FIGURES 1 and 2. The remainder of the 
FIGURE 8 flip-flop is powered by the module level power supply VDD, which also powers 
the other logic of the logic module in which the state retention register resides. As in 
FIGURE 2, the entire flip-flop of FIGURE 8, including the shadow latch, shares a common 

10 ground with the remainder of the associated logic module. As in FIGURE 2, the SAVE 

signal and the REST signal are used to transition the flip-flop between active and state 
retention modes of operation. During the active (normal) mode of operation, the SAVE and 
REST signals are maintained at a logic zero level. 

As in FIGURE 2, prior to disconnecting VDD from VCC (see also FIGURE 1), the 

1 5 data stored in the normal functional flip-flop circuit must be saved into the shadow latch. To 

accomplish this, the SAVE signal is strobed high, thereby activating a differential pull-down 
network which includes transistors Ml , M2, M3 and M4. The pull-down network includes a 
pair of legs respectively connected to the complementary storage nodes at the input and 
output of the inverter 21 of the slave latch of the normal functional flip-flop. Depending on 

20 the logic state of the normal functional flip-flop, one of the legs of the differential pull-down 
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network is activated in response to the SAVE signal, in order to save the data from the 
normal functional flip-flop into the shadow latch. The pull-down network can be designed to 
fight off the weak thick-oxide PMOS transistors of the shadow latch. In some embodiments, 
the transistors Ml -M4 and the transistors of the shadow latch are sized in order to ensure that 
5 the contents of the normal functional flip-flop are written into the shadow latch at the worst 

case process corner scenario. In the worst case scenario, the NMOS transistors M3 and M4 
in the weak process corner need to fight off the PMOS transistors of the shadow latch in the 
strong process corner. The flip-flop of FIGURE 8 can be made even more robust and 
reliable in some embodiments by designing for the worst case process corner scenario at low 

10 temperature and low power supply voltage. 

After the SAVE signal has been strobed and the data from the normal functional flip- 
flop has been stored into the shadow latch, the FIGURE 8 flip-flop is ready to go into 
retention mode. The header switch of FIGURE 1 is used to cutoff VDD from VCC, so all 
modes in the logic module that are powered by VDD decay to almost 0 volts. However, the 

1 5 shadow latch is still powered by the separate power supply VRETAIN (not explicitly shown 

in FIGURE 8), so the data is retained in the shadow latch. In order to reduce the leakage 
current while the shadow latch is retaining data with the normal functional flip-flop circuitry 
powered-down, the shadow latch data storage nodes N10 and Nl 1 should be isolated from 
the powered-down circuitry. This is accomplished by thick oxide transistors M3, M4, M5 

20 and M6. The transistors M5, M6, M7 and M8 form a differential pull-down structure which 
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permits restoration of the data stored at nodes N10 and Nil to nodes N7 and N8, 
respectively. The use of this pull-down network rather than, for example, a pass gate 
arrangement between the nodes that N10, Nl 1 and the nodes at N7, N8, permits the storage 
nodes at N10 and Nl 1 to be connected to the gates of transistors M6 and M5, rather than 
5 having the storage nodes N10 and Nl 1 connected to the sources (or drains) of a pass gate 

arrangement. Connection of nodes N10 and Nl 1 to the gates of the thick oxide transistors 
M6 and M5, respectively, advantageously reduces the possibility of current leakage from the 
shadow latch. 

The REST signal is used to restore the data from the shadow latch to the normal 
10 functional flip-flop. Before VDD is re-connected to VCC (see also FIGURE 1), the signal 

REST is driven to a logic 1 voltage. This activates the pull-down network at M5-M8, and 
transistors M9 and Ml 0 provide positive feedback to latch the data that has been retained in 
the shadow latch. This creates a read-back path from the shadow latch to the slave latch of 
the normal functional flip-flop for restoring the retained data. Also, when the signal REST is 
1 5 driven to a logic 1 voltage, the thin oxide transistor at M 1 1 disables the feedback path of the 

slave latch of the normal functional flip-flop. Upon re-connection of VDD to VCC, only the 
shadow latch drives the nodes N7 and N8. This ensures that the nodes N7 and N8 are 
restored to the respective voltages that were present there prior to disconnecting VDD from 
VCC. After the shadow latch has driven the nodes N7 and N8 to their previous voltages, 
20 VDD is re-connected to VCC, after which the REST signal is returned to logic 0. At this 
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point, the shadow latch is isolated from the normal functional flip-flop portion of FIGURE 8, 
which is now ready to resume its normal functional DQ flip-flop operation. 

FIGURE 9 diagrammatically illustrates further exemplary embodiments of a flip-flop 
with state retention capability for use in a state retention register according to the invention. 
5 In the arrangement of FIGURE 9, the normal functional flip-flop circuitry is a negative edge- 

triggered design, rather than a positive edge-triggered design as illustrated in FIGURES 2 and 
8. In this situation, the state retention circuitry is connected as shown to nodes N2 and N3 of 
the normal functional flip-flop circuitry in order to permit the save and restore operations to 
be performed with respect to the master latch of the normal functional flip-flop circuitry. 

10 That is, data from the master latch of the normal functional flip-flop circuitry can be saved 

into the shadow latch, and the data saved into the shadow latch can be restored to the master 
latch of the normal functional flip-flop circuitry. The state retention circuitry of FIGURE 9, 
namely the shadow latch, the SAVE pull-down network M1-M4, and the RESTORE pull- 
down network M5-M10 can, in some embodiments, be the same as described above with 

1 5 respect to FIGURE 8. In FIGURE 9, when the REST signal is driven to a logic 1 voltage, a 

thin oxide transistor 91 disables the feedback path of the master latch of the normal 
functional flip-flop. 

FIGURE 10 diagrammatically illustrates further exemplary embodiments of a flip- 
flop with state retention capability for use in a state retention register according to the 
20 invention. The flip-flop of FIGURE 10 employs a clock free retention scheme which 
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permits the state retention function to be performed independently of the clock input CLK. 
This is useful for a flip-flop whose clock input is unknown upon power up, for example a 
flip-flop whose clock input is derived from the data output of another flip-flop. The flip-flop 
of FIGURE 10 is generally similar to the flip-flop of FIGURE 8, with the addition of 
5 transmission gate TG3 between node N3 of the master latch and node N9 defined at the 

connection point of series connected transistors M8 and M10, and transmission gate TG4 
connected between node N9 and node N8 of the slave latch. The SAVE operation of the 
flip-flop of FIGURE 10 can be the same as described above with respect to FIGURE 8. 
When the REST signal is activated, node N7 is restored irrespective of the CLK state. Also, 

10 node N9 gets restored. Then, depending on whether CLK is high or low, node N9 drives 

either node N3 of the master latch (when CLK is high) or node N8 of the slave latch (when 
CLK is low). If CLK is low, then data is restored to node N8 to complete the loop in the 
slave latch. If CLK is high, then node N7 drives nodes N2 and N4 through transmission 
gates TGI and TG2, and node N9 drives node N3 through transmission gate TG3. This 

1 5 completes the loop for the master latch. 

In some exemplary embodiments, the transistors M9 and M10 each can have a 0.2 
um width and a 0.4 um length. The flip-flop of FIGURE 10 also includes the transistor 91 
described above with respect to FIGURE 9, so that activation of the REST signal disables the 
feedback path of the master latch. 

-15- 

3216332vl 



Patent Application 
Docket No. TI-34822 

FIGURE 4 diagrammatically illustrates pertinent portions of exemplary embodiments 
of a wireless communication apparatus according to the invention. The wireless 
communication apparatus of FIGURE 4 includes an antenna structure 41 for permitting 
communication via an air interface 42. A data processing apparatus 43 can perform data 
5 processing operations related to the communications on air interface 42. A wireless 

communication interface can utilize conventional techniques to interface the data processing 
apparatus 43 to the antenna structure 41. A user interface 44 can use conventional 
techniques to interface the data processing apparatus 43 to a user of the wireless 
communication apparatus. 

1 0 The data processing apparatus 43 includes a plurality of logic modules that include 

logic for performing data processing operations, and state retention registers for storing data 
associated with the data processing operations. These registers are constructed from 
corresponding pluralities of state retention flip-flops, for example the state retention flip- 
flops illustrated in FIGURES 2, 3 and 8-10. The data processing apparatus 43 further 

1 5 includes a power state controller 45 which, in some embodiments, utilizes state machines to 

provide appropriate control signals to the header switches and logic modules. As shown in 
FIGURE 4, the power state controller provides control signals UP/DN to the respective 
header switches, and provides control signals REST, RETZ (described hereinbelow) and 
SAVE for distribution to each of the logic modules. The power state controller 45 activates 
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these control signals appropriately to implement the exemplary operations described above 
with respect to FIGURES 2, 3 and 8-10. 

FIGURE 5 is a timing diagram which illustrates the timing relationships of the 
control signals produced by the power state controller 45. Note that the VDD waveform of 
5 FIGURE 5 generally timewise corresponds to activation (VDD on) and inactivation (VDD 

off) of the UP/DN signal of FIGURE 4. 

In some embodiments, high level control logic 46 orders the power state controller 45 
to implement the low-power standby (state retention) mode of operation, whereupon the 
power state controller 45 can perform the exemplary signalling described above to 
1 0 implement the standby mode and then report this back to the high level control logic 46. The 

logic 46 and controller 45 can be powered by VRETAIN in some embodiments. 

The wireless communication apparatus of FIGURE 4 can be, for example, a cellular 
or other mobile telephone, a laptop computer, a personal digital assistant, etc. In some 
embodiments, the data processing apparatus 43 is provided as a single integrated circuit such 
15 as a microprocessor, microcontroller or digital signal processor. 

Referring again to FIGURES 2 and 8-10, the SAVE signal must be guaranteed to be 
low during state retention. In some embodiments, SAVE is distributed using a buffer tree 
powered by the retention supply VRETAIN (also referred to herein as VRET). In these 
embodiments, the buffer tree that distributes SAVE to the state retention circuitry of a given 
20 logic module includes a plurality of buffer cells buried in a region of the integrated circuit 
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where the associated logic module is located. These buffer cells are interconnected 
appropriately to route SAVE to the state retention circuitry. In some embodiments, each 
buffer cell is located directly under un-switched VDD metal, that is, a metal layer connected 
to VRET. Each buffer cell is connected to the un-switched VDD metal by a vertical stack of 
5 all metal and via layers. An example of this arrangement is illustrated generally in FIGURE 

6A. 

Referring now to FIGURE 6A, an exemplary buffer cell includes thick-oxide 
transistors 610 and 620 interconnected at 630 and 640 to form an inverter. A metal layer 
MET6 connected to the retention voltage VRET is also connected to the transistor 620 (e.g., 

10 a PMOS transistor) by a vertical stack of all metal and via layers illustrated generally at 650. 

The vertical stack at 650 extends between the metal layer MET6 and a further metal layer 
MET1 . The metal layer MET1 provides connectivity to the gates, sources and drains of the 
transistors 610 and 620. The vertical stack at 650 connects VRET to the transistor 620. The 
N-well 660 that contains the transistor 620 is electrically isolated from adjacent cells as 

15 illustrated generally at 670. In the arrangement of FIGURE 6 A, the N-well 660 observes a 

570nm spacing rule relative to the N-wells in adjacent buffer cells. 

In FIGURE 6A, the vertical stack 650 is routing track-centered on horizontal track 5. 
Further optimization of the horizontal alignment of vertically adjacent buffer cells is used in 
some embodiments to prevent long-run jogging in the routes of metal layers between layers 

20 MET1 and MET6. In some embodiments, minimum area rules with respect to the metal in 
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the metal layers are observed in a manner that does not block more than one track in the 
preferred routing direction. This can reduce the impact of the vertical stack 650 on 
routability. 

The buffer cell arrangement illustrated in FIGURE 6A permits the SAVE signal and 
5 the RETZ signal (described hereinbelow) to be distributed throughout a given logic module 

by a buffer tree which has the same general structure as illustrated in FIGURE 7, but which 
is powered by the retention power supply VRET. 

Some embodiments use a VDD-powered buffer tree to distribute SAVE (and/or 
RETZ). In such embodiments, because VDD is removed from the logic module during state 
10 retention, only a single inverter can be placed between the power state controller 45 (see 

FIGURE 4) and the state retention circuitry of the logic module. This is illustrated generally 
in FIGURE 6. The power state controller 45 produces SAVE' or RET (respective inverses 
of SAVE and RETZ) for the FIGURE 6 arrangement. If the signal SAVE' (or RET) is high, 
the SAVE (or RETZ) signal will remain low (as desired), even while the inverters of 
1 5 FIGURE 6 are not receiving power. 

The exemplary embodiments described above provide numerous advantages, some 
examples of which follow. Only 8 of the larger thick oxide transistors are needed for each 
state retention flip-flop: four transistors in the shadow latch; two transistors for writing to 
the shadow latch; and two transistors for reading the shadow latch. Only 2 thick oxide 
20 PMOS transistors are contained in separate N-wells, namely the PMOS transistors of the 
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shadow latch inverters (see also FIGURES 2 and 8-10). Current leakage during state 
retention is reduced, because only thick oxide (low leakage) transistors remain biased during 
state retention. 

The REST signal can be undefined during the state retention mode. This 
5 advantageously permits use of a fully active buffer tree (powered by VDD) to distribute the 

REST signal to the state retention registers, as illustrated generally in FIGURE 7. The REST 
signal can therefore propagate very quickly (for example in a few nanoseconds) when VDD 
is re-connected to VCC, so the data from the shadow latch can be restored into the normal 
functional flip-flop very quickly, for example in around 100 nanoseconds. With a restoration 

10 time in the 100 nanoseconds range, a logic module can be placed into the low-power state 

retention mode at any time, because the restoration operation happens quickly enough to be 
undetectable by system software or hardware. Thus, the state retention mode is transparent 
to the data processing system. 

Transistors M4 and M7 of FIGURES 2 and 3, and transistors M7-M10 of FIGURES 

15 8- 1 0 do not draw switching current from the VRETAIN power supply during state retention 

or during the transition from state retention to normal operation. This advantageously 
permits the VRETAIN power supply voltage to be routed as a conventional logic signal (e.g., 
to be routed to many registers in parallel as shown in FIGURE 7), thus eliminating the need 
for a conventional power grid to distribute VRETAIN. If routed as a logic signal, the 

20 VRETAIN power supply can collapse when the SAVE signal is asserted, but sufficient time 
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can be allowed for VRETAIN to return to its DC level before the SAVE signal is de- 
asserted. Although this increases the time required to perform a state save operation, the 
state save operation does not limit the system response time. 

All of the thick oxide (high V t , low leakage) transistors of FIGURES 2, 3 and 8-10 
5 can be inoperable (i.e., at any level of conductance/resistance) without affecting the normal 

operation of the normal functional flip-flop. This advantageously permits conventional 
Vbox-min testing at low V t levels, even though operation of the high V t transistors is 
unpredictable at such low V t levels. 

Referring again to FIGURE 4, in some applications, it may be desirable to power- 

10 down one or more of the logic modules independently of the remaining logic modules, in 

order to reduce current consumption. In these situations, the output signal voltages of the 
powered-down logic module(s) should be maintained in order to ensure that any powered-up 
logic module that is driven by an output signal of a powered-down logic module will not be 
affected by the powering-down of the driving logic module(s). 

1 5 FIGURE 1 1 diagrammatically illustrates the need to maintain output signals from a 

powered-down logic module. In FIGURE 11, modules A, B and C are powered by 
respectively different power supplies, VDD A , VDD B and VDDc, respectively. VRET (also 
referred to hereinabove as VRETAIN) is the retention power supply, which is common to all 
three of the logic modules. If only module A will be powered-down (by removal of VDD A ), 

20 then the signal voltage levels at its outputs should be maintained in order to permit continued 
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operation of modules B and C. As shown in FIGURE 1 1 , each output of module A can have 
associated therewith an apparatus S for maintaining the associated output signal voltage 
while module A is powered-down. 

FIGURE 12 diagrammatically illustrates exemplary embodiments of an apparatus S 
5 for maintaining an output voltage signal level of a logic module that has been powered-down 

(see module A of FIGURE 1 1). In FIGURE 12, reference characters Ml, M2, etc. from 
FIGURES 2, 3 and 8-10 are re-used, but, as can be seen from the drawings, they do not 
necessarily refer to the same types of transistors (NMOS or PMOS, thick or thin oxide) to 
which they refer in FIGURES 2, 3 and 8- 1 0. The input node IN of the apparatus of FIGURE 

10 12 can be connected, for example, to the Q output of any of the flip-flops in FIGURES 2, 3 

and 8-10. During normal powered-up operation of the logic module A, the series-connected 
inverters Ml, M2, and M5, M6 form a driver that buffers the signal from node IN to the 
output node OUT. This driver can be selectively disabled by using transistors M3, M4 and 
M7 to disconnect the second stage inverter M5, M6 from its power supply inputs, namely 

1 5 VDD A and ground (VSS). The shadow latch and differential pull-down network illustrated 

generally at 1 2 1 in FIGURE 1 2 can be the same as the corresponding structure in FIGURES 
8-10. As shown, the data inputs to the structure 121 are the signal at the input node IN of the 
first stage inverter Ml, M2 and the signal at the output node INZ of the first stage inverter 
Ml , M2. The SAVE signal of FIGURE 12 can be the same as described above with respect 

20 to FIGUREs 2, 3 and 8-10. The transistors of the shadow latch inverters and the transistors 
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Ml 2 and Ml 4 of the pull down network are thick oxide transistors, and the transistors of the 
shadow latch inverters are powered by the retention power supply VRET. 

The output node 122 of the shadow latch drives the input of an inverting driver stage 
M8-M1 1 . In particular, transistors M9 and Ml 0 form an inverter between node 122 and the 
5 OUT node. The transistors M8 and Mil provide the capability of selectively disabling the 

inverter M9, M10 by disconnecting it from its power supply inputs, namely VRET and 
ground. All of the transistors M8-M1 1 are thick oxide transistors. 

During normal, powered-up operation of the logic module A, the signal RET is low 
and the complementary signal RETZ is high. Under these conditions, the parallel 
10 combination of transistors M3 and M4, together with transistor M7, provide a connection 

between the second stage inverter M5, M6 and its power supply inputs, namely VDDa and 
ground. During retention mode, when RET is high and RETZ is low to disable the second 
stage inverter, there may be some leakage through transistor M3. In order to reduce the 
impact of such leakage, the width-to-length (W/L) ratio of transistor M3 can be selected to 
1 5 be relatively small, for example, in a range from about 3 to about 10. Conversely, because 

leakage during retention mode is not a problem with respect to the thick oxide transistor M4 
(or M7), the width-to-length ratio of M4 (and M7) can be relatively large, for example in a 
range from about 30 to over 100, in order to increase speed 

Transistor M3 has a lower V t than does transistor M4, thereby permitting normal 
20 operation of the arrangement of FIGURE 1 2, even when VDD A is very low, for example, as 
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low as 0.7 volts. Thus, the arrangement of FIGURE 12 can be subjected to Vbox-min testing 
at low V t levels, even though operation of the high V t transistor M4 is unpredictable at low 
V t levels. 

After the data signal defined at IN and INZ is latched into the shadow latch at 121 by 
5 strobing the SAVE signal high, the signal RETZ is taken low, in order to invoke the retention 

mode of operation. The strobing of SAVE also latches the data signal into an internal 
shadow latch within logic module A, for example, the shadow latch of one of the state 
retention flip-flops described above relative to FIGURES 2, 3 and 8-10. With RETZ low and 
its complement RET high, the second stage inverter M5, M6 is disabled, and the output 
10 inverter driver M9, M10 is enabled, thereby providing the contents of the shadow latch at 

121 to an input of another powered-up logic module, such as module B or module C of 
FIGURE 11. 

After the data signal from the logic module A is restored at node IN (by operation of 
the restore signal REST in the corresponding state retention flip-flop of module A), then the 
15 signal RETZ can be taken high again, thereby disabling the inverter M9, M10 and enabling 

the inverter M5, M6 to re-institute normal output operation of module A. 

FIGURE 13 is a timing diagram which illustrates exemplary operations (described 
above) that can be performed by the power state controller 45 of FIGURE 4 in order to 
control operation of the apparatus of FIGURE 12. In some embodiments, the SAVE signal 
20 and the restore signal REST can be produced and distributed in the same manner described 
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above with respect to FIGURES 2- 1 0, and RETZ can be produced and distributed in the same 
manner as SAVE. In the example of FIGURE 13, the restore signal REST goes high after 
VDD A comes back up, whereas FIGURE 5 illustrates the restore signal REST going high 
somewhat before VDD comes back up. This distinction is not operationally significant 
5 because, as discussed above with respect to FIGURE 7, the restore signal REST is 

distributed within a given logic module by a VDD-powered buffer tree. So, even if the 
power state controller 45 of FIGURE 4 drives the restore signal REST high before VDD 
comes back to the corresponding logic module, the restore signal REST will not become 
active within that logic module until VDD comes back up to power the buffer tree that 
1 0 distributes REST throughout the logic module. 

Although exemplary embodiments of the invention are described above in detail, this 
does not limit the scope of the invention, which can be practiced in a variety of 
embodiments. 
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